Fast word-graph generation for spontaneous conversational speech translation
نویسندگان
چکیده
This paper introduces the latest advances in research at ATR on speech translation for spontaneous conversations, especially focusing on speech recognition e orts. For recognition, we employ a word search technique that generates moderate sized word graphs in real-time. To cope with a variety in length of utterances, e.g., word, phrase, sentence fragment, sentence, and concatenated sentences in spontaneous speech, we have adopted a two pass search strategy that uses variable-order word n-gram statistics in the rst stage and task dependent language constraints in the second stage. This strategy is evaluated using the \ATRTravel Arrangement" corpus.
منابع مشابه
Pronunciation variant analysis using speaking style parallel corpus
To improve the recognition accuracy for spontaneous conversational speech, we collected a corpus to study how spontaneous conversational speech differs from read style speech. The corpus consists of two parts: 1) spontaneous conversational speech and 2) read speech with the same word transcriptions as the conversational speech. In word and phone recognition experiments, it was confirmed that, f...
متن کاملThe ISL evaluation system for Verbmobil-II
This paper describes the 2000 ISL large vocabulary speech recognition system for fast decoding of conversational speech which was used in the German Verbmobil-II project. The challenge of this task is to build robust acoustic models to handle di erent dialects, spontaneous e ects, and crosstalk as occur in conversational speech. We present speaker incremental normalization and adaptation experi...
متن کاملInteractive Translation of Conversational Speech
We present JANUS-II, a large scale system effort aimed at interactive spoken language translation. JANUS-II now accepts spontaneous conversational speech in a limited domain in English, German or Spanish and produces output in German, English, Spanish, Japanese and Korean. The challenges of coarticulated, disfluent, ill-formed speech are manifold, and have required advances in acoustic modeling...
متن کاملIsip 2000 Conversational Speech Evaluation System
In this paper, we describe the ISIP Automatic Speech Recognition system (ISIP-ASR) used for the Hub-5 2000 English evaluations. The system is a public domain cross-word context-dependent HMM based system and has all the functionality normally expected in an LVCSR system, including Baum-Welch training for continuous density HMMs, phonetic decision tree-based state-tying, word graph generation an...
متن کاملA language model for conversational speech recognition using information designed for speech translation
In this paper, a new language model is proposed for speech recognition in conversational speech translation. In conversation, speech strongly depends on the previous utterance of the other participant. Applying this dependency in language modeling, we can reduce the speech recognition error rate. To this end, we propose the following new language model where the content of the previous utteranc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997